Using the Triangle Inequality to Reduce the Number of Comparisons Required for Similarity-Based Retrieval
نویسندگان
چکیده
Dissimilarity measures, the basis of similarity-based retrieval, can be viewed as a distance and a similarity-based search as a nearest neighbor search. Though there has been extensive research on data structures and search methods to support nearest-neighbor searching, these indexing and dimension-reduction methods are generally not applicable to non-coordinate data and non-Euclidean distance measures. In this paper we reexamine and extend previous work of other researchers on best match searching based on the triangle inequality. These methods can be used to organize both non-coordinate data and non-Euclidean metric similarity measures. The eeectiveness of the indexes depends on the actual dimen-sionality of the feature set, data, and similarity metric used. We show that these methods provide signiicant performance improvements and may be of practical value in real-world databases.
منابع مشابه
Sparse Spatial Selection for Novelty-Based Search Result Diversification
Novelty-based diversification approaches aim to produce a diverse ranking by directly comparing the retrieved documents. However, since such approaches are typically greedy, they require O(n) documentdocument comparisons in order to diversify a ranking of n documents. In this work, we propose to model novelty-based diversification as a similarity search in a sparse metric space. In particular, ...
متن کاملConstrained Aggregate Similarity Queries in Metric Spaces
Abstract. The optimization of similarity queries using metric access methods has been widely discussed in the last decades. Similarity queries consider one object as the query center, and retrieve objects that are either far up to a radius or the nearest ones. Another important retrieval operation, less studied so far, is the Aggregate Similarity Query, which retrieves objects with the smallest...
متن کاملAn experimental study on the performance of visual information retrieval similarity models
This paper is an experimental study on the performance of the two major methods for macro-level similarity measurement: linear weighted merging and logical retrieval. Performance is measured as the average query execution time for a significant number of tests. The two models were implemented in the standard version (as they are applied in a number of prototypes) and in an optimized version. Th...
متن کاملPivot-based Metric Indexing
The general notion of a metric space encompasses a diverse range of data types and accompanying similarity measures. Hence, metric search plays an important role in a wide range of settings, including multimedia retrieval, data mining, and data integration. With the aim of accelerating metric search, a collection of pivotbased indexing techniques for metric data has been proposed, which reduces...
متن کاملMacroeconomic Policies and Increasing Social-Health Inequality in Iran
Background Health is a complex phenomenon that can be studied from different approaches. Despite a growing research in the areas of Social Determinants of Health (SDH) and health equity, effects of macroeconomic policies on the social aspect of health are unknown in developing countries. This study aimed to determine the effect of macroeconomic policies on increasing of the social-health inequa...
متن کامل